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DNA SEQUENCING AND GENE IDENTIFICATION 



CROSS REFERENCE TO RELATED APPLICATION 

Reference is made to commonly assigned, co-pending U.S. Patent 

5 Application Serial Number by Yang et al., (Docket 83965) filed 

entitled "Method For DNA Sequencing and Gene 

Identification". 



FIELD OF THE INVENTION 

1 o This invention relates to a method for identifying a target DNA 

molecule. 



BACKGROUND OF THE INVENTION 

With the human genome project moving to the post genomic 

1 5 sequencing era, techniques such as single nucleotide polymorphism analysis, 
genomic function analysis, andproteome analysis have found wide spread 
applications. However, important technical challenges remain such as DNA 
sequencing or gene identification speed, length of the DNA that can be read 
during a single sequencing run, and the amount of nucleic acid template required. 

20 These factors suggest the preference of sequencing the genetic information of 
single cells without prior amplification and without prior need to clone the 
genetic materials into sequencing vectors. Practical methods in single molecule 
detection (SMD) for sequencing DNA or identifying characteristic genetic 
segments in a single chromosome, with high speed, highly-automated, and long 

25 read lengths are highly needed. 

There are two traditional techniques for sequencing DNA: 1) the 
dideoxy termination method developed by Sanger et al. (Proc. Natl. Acad. Sci. 
U.S.A. 74, 5467 (1977)), and 2) the Maxam-Gilbert chemical degradation method 
developed by Maxam and Gilbert (Proc. Natl. Acad. Sci. U.S.A. 74, 564 (1977)). 

30 Both methods involve either ultrathin slab gel electrophoresis or capillary array 
electrophoresis techniques, which are labor-intensive and time-consuming, and 
require extensive pretreatment of the sample DNA. More recently, methods using 



dyes or fluorescent labels associated with the terminal nucleotide have been 
developed; however, the sequencing is still done with gel electrophoresis and 
automated fluorescent detectors. 

Soper et al., in U.S. Patent No. 5,846,727, have disclosed a 
5 method that uses a single-mode optical fiber to direct the excitation light to the 
capillary channel, and the fluorescence signals are detected with a second single- 
mode optical fiber. The Soper et al. method requires polymerase chain reaction 
(PCR) amplification of a template DNA, and purification and gel electrophoresis 
of oligonucleotide sequencing ladders prior to initiation of the separation 
1 0 reaction. These procedures require significant quantities of a target DNA. 

Several attempts towards single molecular DNA sequencing or 
detection have been made. For example, Goodwin et al. in "Application of Single 
Molecule Detection to DNA Sequencing" Nucleos. Nucleot. 16, 543, (1991), 
described a method of using DNA polymerase to synthesize a complete 
1 5 complementary strand which incorporates four different fluorescently labeled 
deoxyribonucleotide triphosphate (dNTP) analogs, and sequentially releases 
individual fluorescently labeled dNTPs using exonuclease. In this method, both 
polymerase and exonuclease have to show activity on a highly modified DNA 
strand, and a DNA strand substituted with four different fluorescent dNTP has to 
20 be generated. 

In addition, the previous attempts in single molecular DNA 
sequencing, as disclosed in U.S. Patents 5,209,834, 4,962,037 and 5,405,747, all 
use fluorescent molecules as labels, and thus have to face the difficulties in single 
fluorescent molecule detection techniques, which are found to be quite 
25 complicated and challenging as described in U.S. Patent 6,049,380 of Goodwin et 
al. 

Other approaches to the SMD of DNA include using scanning 
probe microscopy to determine the spatial sequence of fixed and stretched DNA 
molecules on a substrate as disclosed by Hansma et al. {Science, 256, 1 180, 
30 (1 992)) . However, there is a problem with this method since the narrow spacing 
of bases in DNA molecules and the small physicochemical differences among the 
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bases has to be differentiated. It is also difficult for such a method to become fast 
and with a high throughput. 

Microfluidic systems are very important in several applications. For 
example, U.S. Patent 5,445,008 discloses these systems in biomedical research such 
as DNA or peptide sequencing. U.S. Patent 4,237,224 discloses such systems used in 
clinical diagnostics such as blood or plasma analysis. U.S. Patent 5,252,743 discloses 
such systems used in combinatorial chemical synthesis for drug discovery. U.S. 
Patent 6,055,002 also discloses such systems for use in ink jet printing technology. 

The so-called "Lab-on-a-Chip" generally refers to a 
microfabricated device of microfluidic systems that regulate, transport, mix and 
store minute quantities of liquids rapidly and reliably to carry out desired 
physical, chemical, and biochemical reactions in large numbers. These devices 
have been disclosed in U.S. Patents 5,876,675; 6,048,498, and 6,240,790 and WO 
publication 01/70400. One of the most important issues in the lab-on-a-chip 
devices is the moving and mixing of multiple transport fluids inside the chip in a 
controlled fashion. Several methods of transferring and controlling liquids have 
been disclosed by U.S. Patents 6,192,939 and 6,284,113 and by publications WO 
01/01025 and WO 01/12327. These methods involve either electrokinetic 
transport mechanisms or controlling applied pressure or vacuum. 

It is an object of this invention to provide a method for single 
molecule identification of a target DNA molecule. 

SUMMARY OF THE INVENTION 

This and other objects are achieved in accordance with this 
invention which comprises a method for single molecule identification of a target 
DNA molecule in a random coil state comprising the following steps: 

a) attaching an optically distinguishable material to a DNA 
sequence recognition unit; 

b) hybridizing the DNA sequence recognition unit to the target 
DNA molecule in a random coil state to form a hybridized DNA complex in a 
random coil state; 



c) passing the hybridized DNA complex in a random coil state 
from a reservoir in a microfluidic device through a narrow channel to cause an 
acceleration of flow through the channel, thereby causing the hybridized DNA 
complex to extend into a substantially linear configuration; and 
5 d) detecting the optically distinguishable material in a sequential 

manner along the substantially linear hybridized DNA complex, thereby 
identifying the target DNA molecule. 

By use of the invention, a SMD of a target DNA molecule can be 
identified in a fast and efficient manner. 

10 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. la is a schematic representation of a microfluidic device with a 
narrow channel and a reservoir on each side. 

Fig. lb is a photograph of a microfluidic device described in Fig. 

15 la. 

Fig. lc shows photographic images of ^-bacteriophage DNA 
passing through the microfluidic device of Fig. la. 

Fig. 2 is a schematic representation showing how a target DNA in 
a random coil state can be stretched and hybridized with a series of DNA 
20 recognition units conjugated with optically distinguishable materials. 

Fig. 3a is a schematic representation of a check valve microfluidic 

device. 

Fig. 3b is a photograph of the microfluidic device of Fig. 3 a 
Fig. 3 c shows photographic images of ^-bacteriophage DNA 
25 passing through the microfluidic device of Fig. 3b. 

DETAILED DESCRIPTION OF THE INVENTION 

The international collective effort on whole genome sequencing of 
various organisms has resulted in the deposition of hundreds of bacterial and viral 
30 genome sequences into a gene bank data base. The establishment of such a 

publicly accessible data base make it extremely easy to get access to the whole 
genome sequence of many disease bacteria and viruses through their accession 
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numbers, e.g., gram-negative bacterium Escherichia coli 0157:H7 strain 
EDL933, as described in the January 25, 2001 issue of Nature (accession number 
AE005177), and gram-positive bacterium Bacillus subtilis, as described in the 
November 20, 1997 issue of Nature (accession number AL009126). Once a 
5 bacterium or virus genome sequence is known, it is possible to design multiple 
gene or DNA sequence recognition units, which are specifically, targeted on the 
unique nucleic acid fragments of the bacterium or virus genome. Such a designed 
gene or DNA sequence recognition unit can be easily made using an automatic 
DNA synthesis machine and covalently attached to an optically distinguishable 

1 0 material. Therefore, there exists a library, which contains known DNA sequence 
recognition units. 

A DNA molecule consists of four bases, A, T, G, and C, which are 
connected in linear manner covalently. The interaction among four bases follows 
the "Watson-Crick" base paring rule of A to T and G to C mediated by hydrogen 

1 5 bonds. When two single strand DNA molecules having a perfect "Watson-Crick" 
base paring match, they are referred as a complementary strand. The interaction 
between two complementary strands is termed hybridization. Sometimes 
complementary strands may contain one or more base-pairing mismatches as 
well. 

20 The present invention provides a novel approach to the SMD of a 

DNA molecule utilizing a known library of DNA sequence recognition units 
attached to a variety of optically distinguishable materials. When such optically 
distinguishable material attached DNA sequence recognition units are allowed to 
hybridize to a target DNA molecule intended to be identified, a series of optically 

25 distinguishable materials will associate with a target DNA molecule at a specific 
sequence location through hybridization between DNA sequence recognition 
units and their complementary sequence fragment on the target DNA molecule. 
When the hybridized target DNA molecule is stretched from a random coil to a 
linear state, then the optically distinguishable material can be determined in a 

30 linear sequential manner. Therefore the genetic sequence information and the 
identity of the target DNA molecule can be obtained. 
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Some commonly used DNA sequence recognition units which can 
used in the invention include, for example, DNA and DNA fragments, synthetic 
oligonucleotides, and peptide nucleic acids. In another embodiment of the 
invention, the DNA sequence recognition units can be any protein scaffold or 
5 synthetic molecular moiety capable of recognizing a specific DNA sequence. 

The invention can be used to rapidly identify bacteria or viruses 

and genes. 

Optically distinguishable materials which can be used in the 
invention include, for example, colored microparticles, such as, dyes, dye 
10 aggregates, pigments or nanocrystals; or microparticles, such as polymers or 

inorganic materials, having different shapes, such as curvilinear, spherical, donut 
shaped, elliptical, cubic, rod, etc. In a preferred embodiment of the invention, the 
optically distinguishable material comprises polymeric microparticles colored 
with a dye. 

15 A method for coloring a microparticle has been described by L.B. 

Bangs in "Uniform Latex Particles; " Seragen Diagnostics Inc. 1984, the 
disclosure of which is hereby incorporated by reference. Another approach to 
coloring a microparticle with dye is by covalently coupling one or more dyes to 
the surface of the microparticles. Examples for this approach can be found in US 

20 Patents 5, 1 94,300 and 4,774, 1 89, the disclosures of which are hereby 

incorporated by reference. Colorants and pigments can also be incorporated into 
microparticles using micro-encapsulation methods as described in U.S. Patents 
5,073,498 and 4,717,655, the disclosures of which are hereby incorporated by 
reference. These methods can be performed by anyone skilled in the art. 

25 Suitable methods for preparing polymeric particles are emulsion 

polymerization, as described in "Emulsion Polymerization" by I. Piirma, 
Academic Press, New York (1982) or by limited coalescence as described by T. 
H. Whitesides and D. S. Ross in J. Colloid Interface Science, vol. 169, pages 48- 
59, (1985), the disclosures of which are hereby incorporated by reference. The 

30 particular polymer employed to make the particles or microparticles is usually a 
water immiscible synthetic polymer that may be colored, such as any amorphous 
water immiscible polymer. Examples of polymers that are useful include 
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polystyrene, poly(methyl methacrylate) and poly(butyl acrylate). Copolymers 
such as a copolymer of styrene and butyl acrylate may also be used. In a 
preferred embodiment of the invention, the microparticles have a particle size of 
from about 0.001 pm to about 10 pm, preferably from about 0.05 pm to about 1 
5 pm. 

In another preferred embodiment of the invention, the DNA 
sequence recognition units are chemically attached to the optically distinguishable 
materials. The attachment of DNA sequence recognition units to the surface of 
microparticles can be performed according to the published procedures in the art 

10 (Bangs Laboratories, Inc, Technote #205). Some commonly used attachment 
groups on the surface of the microparticles include carboxyl, amino, hydroxyl, 
hydrazide, amide, chloromethyl, epoxy, aldehyde, etc. 

Other methods of attachrng the optically distinguishable materials 
with DNA sequence recognition units include the use of bioactive links such as 

1 5 Biotin-Strepavidin bonding or antigen-antibody bonding. 

In another preferred embodiment of the invention, more than one 
pair of optically distinguishable materials and their conjugated DNA sequence 
recognition units are used in determining or identifying the characteristic genomic 
information of a DNA molecule. 

20 The term, "microfluidic", "microscale" or "microfabricated" 

generally refers to structural elements or features of a device, such as fluid 
channels, chambers or conduits, having at least one fabricated dimension in the 
range of from about 0.1 pm to about 500 pm. In devices used in the present 
invention, channels or chambers in the device are present which preferably have 

25 at least one internal cross-section dimension, e.g., depth, width, length, diameter, 
etc., between about 0. 1 pm to about 500 pm, preferably between about 1 pm to 
about 300 pm. 

The microfluidic devices used in this invention are preferably 
fabricated using the techniques commonly associated with the semiconductor 
30 electronics industry, e.g., photolithography, dry plasma etching, wet chemical 

etching, etc., on the surface of a suitable substrate material, such as silicon, glass, 
quartz, ceramics, as well as polymeric substrates, e.g., plastics. In a preferred 
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embodiment of the invention, microfiuidic devices typically comprise two or more 
layers of fabricated components that are appropriately mated or joined together. 

Various techniques using chip technology for the fabrication of 
microfiuidic devices, and particularly micro-capillary devices, with silicon and glass 
5 substrates have been discussed by Manz, et al. {Trends in Anal. Chem. 1990, 10, 144, 
and Adv. In Chromatog. 1993, 33, 1), the disclosure of which is hereby incorporated 
by reference. Other techniques such as laser ablation, air abrasion, injection molding, 
embossing, etc., are also known to be used to fabricate microfiuidic devices, 
assuming compatibility with the selected substrate materials. 

1 o In the invention, DNA molecules are being stretched from a random 

coil configuration to a substantially linear state by passing through a micro-fluidic 
device. Large DNA molecules, like all macromolecules, have a random coil 
configuration under a non-perturbed condition. However, large DNA molecules can 
be stretched to a linear state by applying a microscopic force such as the 

1 5 hydrodynamic forces that can be generated by macroscopic or microfiuidic flows. 
These flows can be generated by using a microfiuidic device, which can be driven 
electrophoretically, electro-osmotically, or by external pressure. When a large DNA 
molecule in solution passes with an elongational flow associated with acceleration of 
the fluid from a reservoir into a microfiuidic channel, the DNA molecule can be 

20 oriented and stretched a linear state in the direction of the flow for at least a fraction 
of a second. 

In Fig. la, a microfiuidic device is shown to have a microfiuidic 
channel 10, and a fluid reservoir 20 connecting to each end of the channel. The fluid 
reservoir 20 also connects with either a fluid inlet 100 or a fluid outlet 200. A 
25 photograph of a part of the device is also shown in Fig. lb. The width and depth of 
the microfiuidic channels are from about 0. 1 pan to 1000 pm, preferably from 1 pm 
to 500 urn. 

When a ^-bacteriophage DNA, which has 48,502 base pairs, flows 
from the microfiuidic reservoir (point A in Fig. la), along the channel centerline and 
30 into the channel, the DNA molecule is extended and stretched from a random coil 
configuration, which has a size (i.e., radius of gyration) of just under 1 pm in 
quiescent solution, to a substantially linear state as shown in Fig. lc. Through the 



appropriate choice of flow geometry, temperature, and solvent conditions, the 
recovery of the equilibrium and the transition to a coiled configuration may be 
delayed for more than several seconds. 

Fig. 2 schematically shows how to use a mixture of such optically 
5 distinguishable materials conjugated with DNA sequence recognition units to 
identify bacterial or viral chromosomal DNA. First of all, a chromosomal DNA 
from a bacterium or virus was isolated and stretched from random coil state to a 
linear state. This can be done by using one of the DNA stretching methods as 
described above. Secondly, a mixture of optically distinguishable materials 
1 0 conjugated with DNA sequence recognition units with sequences complementary 
to some gene fragment sequences of the target DNA intended to be identified was 
allowed to hybridize with linear stretched DNA. Thirdly, upon the completion of 
the hybridization event, the order of optically distinguishable materials hybridized 
to the linearly stretched target DNA was determined. Since each bacterium or 
1 5 virus has its unique chromosomal DNA sequence, the order determination of the 
optically distinguishable markers should unambiguously detect a bacterium or 
virus intended to be identified. 

The following examples are provided to illustrate the invention. 
EXAMPLES 

20 Example 1 

This example illustrates the attachment of a pre-synthesized single 
strand oligonucleotide as a DNA sequence recognition unit to the surface of a 
microparticle, and the detection of a fluorescence signal due to the hybridization 
between a DNA recognition unit on the surface of such modified microparticles 
25 and its fluorescently labeled complementary single strand target DNA, in order to 
demonstrate the feasibility of the invention. 

One hundred microliters of microparticle (4% w/v) was rinsed 
three times in an acetate buffer (0.01 M, pH5.0), and combined with one hundred 
microliters of 20 mM 2-(4-Dimethylcarbomoyl-pyridino)-ethane-l -sulfonate and 
30 ten percent of polyethyleneimine. The mixture was agitated at room temperature 
for one hour and rinsed three times with sodium boric buffer (0.05 M, pH8.3). 
The beads were re-suspended in a sodium boric buffer. 
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A 22-mer oligonucleotide DNA sequence recognition unit with 5'- 
amino-C6 modification was dissolved in one hundred microliters of sodium boric 
buffer to a final concentration of 40 nmol. 20 microliters of cyanuric chloride in 
acetonitrile was added to the DNA sequence recognition unit solution and the 
total volume was brought up to 250 microlites using a sodium boric buffer. The 
solution was agitated at room temperature for one hour and then dialyzed against 
one liter of boric buffer at room temperature for three hours. 

1 00 microliters of the dialyzed DNA solution was mixed with 200 
microliters of the bead suspension. The mixture was agitated at room 
temperature for one hour and rinsed three times with a sodium phosphate buffer 
(0.01 M, pH7.0). 

A 22-mer oligonucleotide DNA with a 5 '-fluorescein label, which 
has a complementary sequence to the 22-mer DNA sequence recognition unit, 
was dissolved in a hybridization solution (6XSSPE-SDS) containing 0.9 M NaCl, 
0.06 M NaH 2 P0 4 , 0.006 M ethylenediamine tetraacetic acid, and 0.1% SDS, pH 
7.6 to a final concentration of 1M. The 22-mer oligonucleotide DNA sequence 
recognition unit attached to the microparticle was hybridized in the hybridization 
solution starting at 68°C and slowly cooled down to room temperature. 
Following hybridization, the microparticles were washed in 0.5XSSPE-SDS for 
15 minutes three times. The fluorescence image of the microparticles was 
obtained using an Olympus BH-2 microscope (Diagnostic Instruments, Inc. SPOT 
camera, CCD resolution of 1315 x 1033 pixels) with DPlanapo40 UV objective, 
mercury light source, blue excitation & barrier filters. 

The above example demonstrates the feasibility of coupling a 
DNA recognition unit, a 22-mer synthetic oligonucleotide, to an optically 
distinguishable material-microparticle, and the capability of detecting the 
hybridization event between the DNA recognition unit and a sequence 
complementary target DNA molecule, a 22-mer oligonucleotide DNA with 5'- 
fluorescein label. 

Furthermore, a dye can be incorporated into the microparticles as 
described above to produce population and sub-population of optically 
distinguishable materials, which subsequently can be coupled to different DNA 
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recognition units. Since it has been demonstrated that such a DNA recognition 
unit associated with an optically distinguishable material can hybridize to a target 
DNA molecule with a complementary sequence, using one of the methods to 
stretch a DNA molecule, the hybridization complex can be stretched into a linear 
5 configuration to allow the detection of a series of optically distinguishable 
materials in a sequential manner along the linear hybridized DNA complex, 
thereby identifying the target DNA molecule. 

Alternatively, a target DNA molecule can also be stretched first, 
and then hybridized with a series of corresponding DNA recognition units 
10 coupled to the optically distinguishable materials. Variations of actual operation 
procedure can be modified by one skilled in the art. 

Example 2 

In this example, an alternate geometry of a microfluidic device, known 
15 as a microfluidic check valve, is shown in Fig. 3a. Such a microfluidic device has a 
free-floating central element that controls the direction of the flow. The photograph 
of the microfluidic device is shown in Fig. 3b. When a solution of fluorescence 
labeled ^-bacteriophage DNA molecules, which have 48,502 base pairs, flows 
through the device in the direction indicated in Fig. 2b, the configuration of the DNA 
20 molecules at different locations of the device indicated in Fig. 3b are shown in Fig. 3 c 
under a fluorescence microscope. The DNA molecules in the regions of narrow 
channels (such as A and F), where the elongational flows accelerate, were stretched 
out to near full extension, and the relaxation of the DNA from an extended linear 
state back to its equilibrium coiled configuration, was relatively slow, taking more 
25 than a few seconds. 

The invention has been described in detail with particular 
reference to certain preferred embodiments thereof, but it will be understood that 
variations and modifications can be effected within the spirit and scope of the 
invention 
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